Introduction:

Chronic lymphocytic leukemia (CLL) and small lymphocytic lymphoma (SLL) are closely related diseases with similar characteristics, classified as mature B cell neoplasms. They involve the accumulation of monoclonal B lymphocytes and are considered as one disease with different manifestations. CLL is more prevalent and is usually detected through routine blood tests, while SLL is identified by primary lymph node involvement. Although many patients with CLL/SLL are asymptomatic, some may experience lymphoma-related symptoms. The diagnosis involves blood tests and immunophenotypic analysis. CLL prognosis varies, and staging systems help in determining the appropriate treatment regimen. Machine learning (ML) is being increasingly used in hematology to aid in diagnosis, treatment, and risk-stratification. This review explores the application of ML in the classification and diagnosis of CLL, discussing the performance and limitations of existing models, with the goal of encouraging further research and integration into clinical settings to improve patient care.

Materials and Methods:

A literature search was conducted using the PubMed/MEDLINE and EMBASE databases. EndNote and Rayyan software were used to eliminate duplicate entries and for screening the articles. In addition, the references of the identified articles were manually screened to identify additional relevant studies. Articles with primary data pertaining to the use of ML in CLL/SLL diagnosis and classification were included without language or time restrictions.

Results:

A total of 169 articles were identified and duplicates were removed using Endnote® and Rayyan® software, resulting in 149 articles eligible for screening. The included articles met specific criteria: they utilized ML methods, reported conclusions on the method's reliability or accuracy, and focused on CLL diagnosis and classification. Excluded articles were non-English, animal/in vitro studies, abstracts, and review articles. After screening, 14 studies met the inclusion criteria.

Studies examining the potential of ML algorithms in enhancing the diagnosis of CLL used various data sources like blood smears, flow cytometry, genetic data, and histopathological images. The current evidence suggests that ML can effectively predict CLL diagnosis, aid in screening, identify potential biomarkers, and explore the underlying molecular mechanisms. Many of the ML models evaluated in this review demonstrate high accuracy, ranging from 83% to 100%. Implementing AI and ML in hematology can automate tasks involved in patient workup, risk assessment, and treatment, allowing hematologists to focus on critical aspects of patient care and research.

Despite the promising potential, some important considerations should be acknowledged. First, many reviewed models had limited sample sizes from single centers, limiting their generalizability. It's crucial to develop models using larger, diverse datasets from multiple centers to improve generalizability. Second, there is a lack of prospective studies and research on the impact of ML models on patient outcomes. Future investigations should focus on prospectively assessing the effect of ML models on CLL diagnosis, prognosis, and patient outcomes. Third, the integration of ML applications into direct patient care raises ethical and medico-legal concerns, including liability, data privacy, and doctor-ML application interaction. An ethical framework specific to ML applications in healthcare is needed, and doctors should receive training on ML applications to understand their capabilities and limitations.

Addressing these considerations can lead to successful implementation of ML applications in the care of CLL patients, improving diagnosis and patient outcomes.

Conclusion:

Our findings demonstrate the promising potential of ML algorithms in accurately predicting CLL diagnosis, conducting CLL screening, and identifying genetic biomarkers associated with CLL. Future research endeavors should prioritize the development of large, standardized datasets to effectively train ML models, conduct prospective evaluations to assess their performance, explore their impact on patient outcomes, and establish ethical frameworks to govern their utilization.

No relevant conflicts of interest to declare.

This content is only available as a PDF.
Sign in via your Institution